Stein’s method for the Poisson–Dirichlet distribution and the Ewens sampling formula, with applications to Wright–Fisher models
نویسندگان
چکیده
We provide a general theorem bounding the error in approximation of random measure interest—for example, empirical population types Wright–Fisher model—and Dirichlet process, which is having Poisson–Dirichlet distributed atoms with i.i.d. labels from diffuse distribution. The implicit metric captures sizes and locations masses, so also yields bounds on between masses interest apply result to bound stationary distribution finite model infinite-alleles mutation structure (not necessarily parent independent) by An important consequence our an explicit upper total variation distance partition generated sampling distribution, Ewens formula. small if sample size n much smaller than N1/6log(N)−1/2, where N size. Our analysis requires separate interest, giving second moment number follows new development Stein’s method for viewing process as Fleming–Viot then applying Barbour’s generator approach.
منابع مشابه
The ubiquitous Ewens sampling formula
Ewens’s sampling formula exemplifies the harmony of mathematical theory, statistical application, and scientific discovery. The formula not only contributes to the foundations of evolutionary molecular genetics, the neutral theory of biodiversity, Bayesian nonparametrics, combinatorial stochastic processes, and inductive inference but also emerges from fundamental concepts in probability theory...
متن کاملConvergence Time to the Ewens Sampling Formula
In this paper, we establish the cutoff phenomena for the discrete time infinite alleles Moran model. If M is the population size and μ is the mutation rate, we find a cutoff time of log(Mμ)/μ generations. The stationary distribution for this process in the case of sampling without replacement is the Ewens sampling formula. We show that the bound for the total variation distance from the generat...
متن کاملRejoinder: The Ubiquitous Ewens Sampling Formula
The main article and extended discussion point to Ewens’s sampling formula (ESF) as one of a few essential probability distributions. Arratia, Barbour and Tavaré explain the emergence of ESF by the Feller coupling and also touch on number theoretic considerations; Feng provides deeper background on diffusion processes and nonequilibrium versions of ESF; and McCullagh regales us with a story fro...
متن کاملGeneralization of the Ewens sampling formula to arbitrary fitness landscapes
In considering evolution of transcribed regions, regulatory sequences, and other genomic loci, we are often faced with a situation in which the number of allelic states greatly exceeds the size of the population. In this limit, the population eventually adopts a steady state characterized by mutation-selection-drift balance. Although new alleles continue to be explored through mutation, the sta...
متن کاملA finitary characterization of Ewens sampling formula
The clustering of agents in the market is a typical problem dealt with by the new approaches to macroeconomic modeling, that describe macroscopic variables in terms of the behavior of a large collection of microeconomic entities. Clustering has a lot of economical interpretations [3], that are often described by Ewens Sampling Formula (ESF). This formula can be traced back to Fisher as “species...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Annals of Applied Probability
سال: 2021
ISSN: ['1050-5164', '2168-8737']
DOI: https://doi.org/10.1214/20-aap1600